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The Identification of the influential nodes in networks is one of the most promising domains. In this paper, we 
present an improved iterative resource allocation (IIRA) method by considering the centrality information of 
neighbors and the influence of spreading rate for a target node. Comparing with the results of the Susceptible 
Infected Recovered (SIR) model for four real networks, the IIRA method could identify influential nodes 
more accurately than the tradition IRA method. Specially, in the Erdos network, the Kendall’s tau could be 
enhanced 23% when the spreading rate is 0.12. In the Protein network, the Kendall’s tau could be enhanced 
24% when the spreading rate is 0.08. 

PACS numbers: 89.20.Hh, 89.75.Hc, 05.70.Ln 


I. INTRODUCTION 

Spreading is a ubiquitous phenomena in nature. A lot 
of activities can be seen as spreading in society^ - — . In 
the past few years, the spreading in complex networks is 
concerned more and more with its great theoretical signif¬ 
icance and remarkably practical value, that is, epidemic 
controlling^—, information dissemination^ and viral mar¬ 
keting. One of the fundamental problems is to identify 
the influential nodes in the networks. The knowledge of 
the node’s spreading ability shows new insights for appli¬ 
cations such as identifying influential nodes^r— , design¬ 
ing efficient methods to either hinder epidemic spreading 
or accelerate information dissemination. 

Recently, there are a lot of centrality methods^ have 
been applied to identify the influential nodes in com¬ 
plex networks, including degree, eigenvector centrality^, 
closeness centrality^ and fc-shell decomposition^. The 
degree centrality is based on the number of neighbors 
connected with a node. Chen et ahU defined a local cen¬ 
trality based on the degree information of nearest neigh¬ 
bors and the second nearest neighbors. Poulin et al 
proposed cumulated nomination centrality based on the 
iterative method for solving the feature vector mapping. 
Zhang et alr^ proposed a multiscale measurement by 
considering the interactions from all the paths a node 
is involved. Kitsak et al— found that the most influ¬ 
ential nodes are those located within the core of the 
network by decomposing a network with the fc-shell de¬ 
composition method. By taking into account the neigh¬ 
bors’ k -core values, Lin et al.~ proposed an improved 
neighbors’ k -core (INK) method to identify the influential 
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nodes with the largest k -core values. And by considering 
the fc-shell decomposition method and resource iteration, 
Ma et proposed an improved method to identify 
the influential nodes. In directed networks, some meth¬ 
ods are also designed to identify the influential spread¬ 
ers namely Leader Rank? 2 , which outperforms the well- 
known PageRank method in both effectiveness and ro¬ 
bustness. 



FIG. 1. (Color online) An example network consists of 8 
nodes and 9 edges. We employ the SIR model to simulate the 
spreading process for these nodes. Initially, only one node 
is selected to be infected. For each node, the node spread¬ 
ing influence is defined as the number of the infected nodes, 
then the spreading influence of nodes 1, 2 is 1.079, 1.083 re¬ 
spectively. The result is obtained by averaging over 10000 
independent runs and 3 time steps when the spreading rate 
P is 0.07. In the IRA method, the value of nodes 1, 2 which 
generated by the IRA method is 0.33, 0.29 respectively. 

These existing methods mainly consider the signifi¬ 
cance of node, but the node influence is also affected 
by the importance of its neighbors. Based on this 
idea, Renet alr^ proposed an iterative resource allocation 
(IRA) method to identify influential nodes. However, the 
number of infected neighbor also affect the resource al¬ 
location. Take Fig. 1 as example, the IRA method can 
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not distinguish the influence of the nodes 1 and 2 accu¬ 
rately. Therefore, we argue that either the number of 
neighbors and the spreading rate may affect the prop¬ 
erty of the target nodes simultaneously. Inspired by this 
idea, we present an improved iterative resource alloca¬ 
tion method (HRA), where the resource allocation of the 
target node may adjust by the spreading rate. Then, 
comparing with the Susceptible Infected Recovered (SIR) 
spreading proces s 24 ’ 25 for four real networks, the results 
show that the ranking list generated by our method could 
identify influential nodes more accurately than the one 
generated by the IRA method. 
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Equation (1) can be written in matrix form: 
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where the element a^ of matrix A is given by 
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II. THE IIRA METHOD 

The traditional IRA method supposes that the node in¬ 
fluence is determined by neighbors’ centralities, such as 
the degree centrality, fc-shell method, closeness central¬ 
ity, betweenness centrality and so on. There is a tunable 
parameter to nonlinearly adjust the weight of the cen¬ 
trality. The IIRA method also requires the traditional 
node centralities as the input, such as degree centrality, 
fc-shell method^, closeness centrality^ and eigenvector 
centrality^. In the IIRA method, we argue that each 
node has a initial resource in the initial state, and the 
spreading rate could adjust the allocation of the resource 
which determined by the neighbors’ centralities. After a 
certain number of iterations, the resource of each node 
will approach a steady state and the final amount of re¬ 
source in each node will be used to identify the influential 
nodes. The IIRA method can be described as follows in 
detail. 

We consider that an undirected network G = ( N , E) 
with N nodes and E edges could be described by an 
adjacent matrix H = {Ay} G R n ’ n where 5 l:] = 1 if node 
i is connected by node j, and Aj = 0 otherwise. At 
each iteration step, each node could allocate the resource 
determined by the node centrality to it’s neighbors, and 
the resource allocation is adjusted by the spreading rate. 
Let T(?) be set of node Vs neighbors, the resource of 
node i obtained by resource allocation can be expressed 
as follows: 

Ij(i+1)= Rj^i(t + 1) 


aij = i m ( 4 ) 

\«er(jj J 

We assume that each node has a initial resource(I(0) = 
(1,1, • • • , 1) T ). The ultimate resource of each node will 
reach a new state(I(f) = AI(£ — 1) = A f I(0)). Accord¬ 
ing to the Gersligorin disk theorem^, the spectral radius 
p(A) of matrix A is no larger than 1, and the vector 1(f) 
will converge to a steady value after t times iterations. 
However, the t is not infinite. Then the ranking list gen¬ 
erated by the 1(f) can identify the influential nodes. 



FIG. 2. (Color online) Example of networks with node N = 
5. the node label is 1, 2, 3, 4, 5, respectively. fcsi, 2,3 = 2, 

ks 3,4 = 1. 

For example, a network with 5 nodes and 5 edges is 
shown in Fig. 2. The initial resource of each node equals 
1(1(0) = (1,1,1,1,1) T ). We take fc-shell as the node 
centrality(d = ks as an example) and set spreading rate 
/3 = 0.2. The resource allocation matrix is as follows: 


je r(») 


= E 

jer(i) 


/ \ -1 

(1) 


( 0 

0.29 

0.29 

0.59 

0.59 

\ 

tMi j E (>v ) 


0.12 

0 

0.18 

0 

0 


A = 

0.12 

0.18 

0 

0 

0 


\uer(j) J 



0.03 

0 

0 

0 

0 





\ 0.03 

0 

0 

0 

0 

/ 


( 5 ) 


where Rj^iit + 1) represents that the node j allocates its 
resource to node i at step f + 1, Ij(t) represents the node 
j’s resource at step f, 6t is the centrality of the node i, 
ipi reflects the influence of change spreading rate of the 
node i, j3 is the spreading rate, and the hi is the degree 
of node i, ki can be expressed as 


To make the final resource allocation value convergent,we 
set iterations times f = 50, 1(50) = A 5O I(0) = [8.19e — 
20,4.32e - 20,4.32e - 20,6.7e-21,6.7e- 21] T . The result 
shows that the IIRA method can estimate the influence 
of the target node. 
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III. EXPERIMENT RESULTS 

A. Data description 

In this paper, we evaluate the performance of the IIRA 
method in four networks including the Erdos, US air line, 
Email and Protein networks. The Erdos network^ is a 
scientific collaboration networks. Each node represents 
the scientist whose Erdos number is I and the edge repre¬ 
sents the cooperative connection between each pair of sci¬ 
entists. The US air line network^ is the part of air traffic 
lines in America. Each node represents the city and the 
edge represents the airline between the city and the other 
city. The Email network^ is the network that each node 
represents different people including researchers, techni¬ 
cians, managers, administrators and graduate students. 
The edge between the two nodes indicates that these peo¬ 
ple keep a communication relation with each other. The 
Protein network^ represents the interaction between the 
various proteins. 

The statistical properties of four real networks are 
shown in Table I, including the number of nodes N, the 
number of edges E, the average degree (k) and the epi¬ 
demic threshold j3 c . 

TABLE I. Basic statistical features of the Erdos, US air line, 
Email and Protein networks, including the number of nodes 
N, edges E, the average degree ( k ), the spreading threshold 
Pc 


Network 

N 

E 

(k) 

dc 

Erdos 

474 

1639 

6.916 

0.055 

US air line 

332 

2126 

12.807 

0.021 

Email 

1133 

5451 

9.622 

0.050 

Protein 

2284 6646 

5.820 

0.052 


B. Measurement 

In this paper, we use the SIR mode l 24 ’ 25 to examine 
the node spreading influence. In such a system, there 
are three compartments: (i) Susceptible individuals rep¬ 
resent the number of individuals susceptible to (not yet 
infected) the disease; (ii) Infected individuals represent 
individuals who have been infected and are able to spread 
the disease to susceptible individuals; (iii) Recovered in¬ 
dividuals represent individuals who have been recovered 
and will never be infected again. At each time step, 
for each infected node, one randomly selected suscep¬ 
tible neighbor gets infected with the spreading rate /?, 
and the infected node would recover in one time step. 
The number of infections X , generated by the initially- 
infected node including the initially-infected is denoted 
as its spreading influence, where /3 is the spreading rate in 
the SIR model. The number of infections X is obtained 


by averaging over 20000 independent runs and the time 
step T is set as 5. 

To check the performance of the IIRA method, the 
Kendall’s tau^U r is introduced to measure the correlation 
of the node spreading influence with the IIRA method 
and the IRA method. Kendall’s tau r is used to measure 
the correlation between two ranking list. The Kendall’s 
tau r value lies in [-1,1], and the increasing values im¬ 
ply the method can identify the influential nodes more 
accurately. The Kendall’s tau r is defined as 

r = N(N- 1) - x j)(Vi - Vj)\> ( 6 ) 

where N is the number of nodes in a network, Xi is 
the spreading influence of node i, yi are the values of 
the IIRA method which generated by the degree, close¬ 
ness, fc-shell and eigenvector centralities. The sgn(cc) is 
a piecewise function, when x > 0, sgn(i)= +1; x < 0, 
sgn(a;)= —1; When x = 0, sgn(a;)= 0. 

C. Simulation results 

In this section, we first verify the effectiveness of the 
IIRA method. Figure 3 shows Kendall’s tau r of the 
IIRA method when the iteration times t changes from 1 
to 100. We can find that each Kendall’s tau r gradually 
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FIG. 3. (Color online) The Kendall’s tau r obtained by com¬ 
paring the ranking list generated by the SIR spreading pro¬ 
cess and the ranking lists generated by the fc-shcll (squares), 
degree (circles), closeness (triangles) and eigenvector (dia¬ 
monds) centralities. The iteration times t lies (0,100], and 
the spreading rate /3 = 0.05. 

achieve stability with increasing of t times iterations. 
However, in the four networks, each t is different when 
the Kendalls tau r achieve stability. But we can see that 
every Kendall’s tau r approach a steady value when the 
iterations times t is no larger than 50. We think that 
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the IIRA method can identify the influential nodes effec¬ 
tively when the iterations times t = 50. So we set the 
iteration times of the IIRA method t = 50. 

And then, we evaluate the performance of the IIRA 
method in the four networks. We use relatively small 
values of 0 in SIR model, namely 0 £ (0,0.2]. Figure 
4 shows Kendall’s tau r of the IIRA method where the 
ranking lists are generated by the fc-shell, degree, close¬ 
ness and eigenvector centralities. From which, one can 
find that there are different performances for the differ¬ 
ent centralities. For instance, in the Erdos network, when 
the spreading rate 0 is lower than 0.08, the Kendall’s tau 
r generated by the eigenvector centrality is lower than 
the ones generated by other centralities. However, when 
the spreading rate 0 is larger than 0.08, the Kendall’s 
tau t generated by the eigenvector centrality is much 
larger than other centralities. It means that the ranking 
list generated by the eigenvector centrality could identify 
the influential nodes more accurately than other central- 



FIG. 4. (Color online) The Kendall’s tau r obtained by com¬ 
paring the ranking list generated by the SIR spreading pro¬ 
cess and the ranking lists generated by the fc-shcll (squares), 
degree (circles), closeness (triangles) and eigenvector (dia¬ 
monds) centralities. The results are averaged over 20000 in¬ 
dependent runs with different spreading rate 0. 

ities. In the US air line network, the Kendall’s tau r 
of the IIRA method where the ranking list generated by 
the eigenvector centrality is much larger than ones gen¬ 
erated by other centralities when the spreading rate 0 
lies between 0.02 and 0.08. And in the Email network, 
the Kendall’s tau r of the IIRA method where the rank¬ 
ing list generated by the eigenvector centrality is much 
larger than ones generated by other centralities when the 
spreading rate 0 lies between 0.06 and 0.1. The same 
phenomena could be found for the Protein network when 
the spreading rate 0 lies between 0.07 and 0.13. 

Figure 4 also shows that, in the four networks, the 
Kendall’s tau r of the IIRA method where the ranking 
list generated by the eigenvector centrality firstly increase 
and then decrease with the increasing of the spreading 


rate 0. In the Erdos, Email and Protein networks, the 
Kendall’s tau r of the IIRA method where the ranking 
lists generated by the fc-shell, degree and closeness cen¬ 
tralities firstly increase and then decrease, and will in¬ 
crease again after the drop. Particularly, the Kendall’s 
tau r generated by the /e-shell, degree and closeness cen¬ 
tralities will increase again after the drop in the US air 
line, Email and Protein networks. 
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FIG. 5. (Color online) The improved ratio tj of different 
spreading rates 0 with different centralities including fc-shell, 
degree, closeness and eigenvector centralities in the Erdos, US 
air line, Email and Protein networks. 

Figure 5 reports the improved ratio i] of the Kendall’s 
tau r generated by the IIRA method comparing with 
the results of the IRA method. The improved ratio 77 is 
defined as 


new _ j.0 

1 = T o . m 

where T new is the Kendall’s tau r of the IIRA method by 
considering the degree, fc-shell, closeness and eigenvector 
centralities, and r° is the Kendall’s tau r obtained by the 
IRA method. Clearly, 77 > 0 indicates an advantage of 
the IIRA method. The improved ratios in r for the IRA 
method where the ranking lists generated by the fc-shell, 
degree, closeness and eigenvector centralities with differ¬ 
ent spreading rate 0 are shown in Fig. 5. In the Erdos 
network, the largest improved ratio 77 generated by the 
closeness centrality could reach 22 % when the spreading 
rate 0 is 0.12. In the US air line network, the largest 
77 generated by the closeness centrality could reach 12 % 
when the spreading rate 0 is 0.03. The same phenom¬ 
ena could be found for the Email and Protein networks, 
the improved ratio 77 generated by the closeness centrality 
could reach the largest value when the spreading rate 0 is 
0.05 and 0.08 respectively. In the Erdos and Protein net¬ 
works, when the spreading rate 0 lies between 0.04 and 
0 . 2 , the improved ratios 77 generated by the four kinds 
of centralities are larger than 0 which indicates that the 
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IIRA method could identify the influential nodes more 
accurately than the IRA method. In the US air line net¬ 
work, the improved ratios 77 generated by the closeness 
and eigenvector centralities are larger than 0. The im¬ 
proved ratio 77 generated by the closeness centrality is 
larger than the one generated by the eigenvector central¬ 
ity, it means that the ranking list of the IIRA method 
generated by the closeness centrality is more accurate 
than the one generated by the eigenvector centrality. 

Furthermore, we find the dependence of improved ra¬ 
tio 77 on the spreading rate /3 in Fig. 5. In the Erdos, 
Email and Protein networks, we find that the improved 
ratio 77 generated by the fc-shell centrality has two dis¬ 
tinct trends with the increasing of the spreading rate /3. 
The improved ratio 77 firstly increase and then decrease 
gradually. There are the same trends for the degree and 
closeness centralities. In the Erdos and Protein networks, 
the improved ratios 77 generated by the /c-shcll, degree, 
closeness, eigenvector centralities are larger than 0 when 
the spreading rate /3 is larger than 0.05 which indicates 
that the IIRA method could identify the influential nodes 
more accurately than the IRA method. In the US air line 
network, the improved ratio 77 generated by the closeness 
centrality improve more obviously than the ones gener¬ 
ated by other centralities which indicates that the IIRA 
method generated by the closeness centrality could iden¬ 
tify the node spreading influence more accurately than 
the IRA method. The same phenomena could be found 
for the Email network, which indicates that the ranking 
accuracy of the IIRA method generated by the closeness 
centrality is much better than the IRA method. 


IV. CONCLUSION AND DISCUSSIONS 

By taking into account the neighbors’ resource of the 
node and the influence of spreading rate for the target 
node, we present an improved iterative resource alloca¬ 
tion (IIRA) method to identify the node spreading influ¬ 
ence. The IIRA method considers the infection status of 
target node, and the probability of infection determined 
by the degree of target node. And the new resource allo¬ 
cation is determined by these information. It can be ap¬ 
plied to many classical centralities such as degree, fc-shell, 
closeness, eigenvector centralities. The simulation results 
show that the performance of the IIRA method can be 
further improved than the IRA method without adding 
any other parameters and computation complexity. In 
the four networks, the performance of the improved ra¬ 
tio 77 generated by the closeness centrality is better than 
the ones generated by other centralities. Specially, in 
the Erdos network, the largest improved ratio 77 gener¬ 
ated by the closeness centrality could reach 22 % when 
the spreading rate /3 is 0.12. In the Protein network, 
the largest improved ratio 77 generated by the closeness 
centrality could reach 24% when the spreading rate /3 is 
0.08. These results show that the IIRA method could 
identify the influential nodes more accurately than the 


IRA method. 

In the IIRA method, the final resource of a node is not 
only determined by its neighbors’ centralities but also de¬ 
pended on the number of infected neighbors. It would be 
very interesting to test the modified centrality in other 
dynamic process. For example, the IIRA method only 
considers the influence of spreading rate of target node. 
Whether the influence of target node’s neighbors should 
be considered. It also should be noticed that the IIRA 
method operation requires iterative process, which is very 
time-consuming. So, we should find a feasible method to 
reduce the computational complexity. In addition, in¬ 
terconnected networks have attracted more and more at¬ 
tention recently. How to design a new iterative resource 
allocation method in these networks may be an interest¬ 
ing and important open problem. 
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